Topic modelling approaches to aggregated citation data
نویسندگان
چکیده
In this research in progress paper we report on preliminary results from the proposed novel uses of topic modelling approaches to bibliographic references as sources for “bags-of-words” instead of actual text content in scientometric settings. The actual cited references, viewed as concept symbols for paradigmatic approaches to earlier research, could thereby be used to cluster research. We will demonstrate an explorative approach to using cited reference topics for the discovery of hidden semantic reference structures in a set of scientific articles. If found fruitful and robust, this approach could complement existing text based and citation based techniques to clustering of research that might bridge the two approaches. By approaching references as “words” and reference lists as “sentences” (or documents) of such “words”, we demonstrate that the topical structure of document collections can also be analyzed using an alternative and complementary source of content, which additionally provides an interesting perspective on bibliographic references as units of a meta language describing document content.
منابع مشابه
Modelling the Level of Adoption of Analytical Tools; An Implementation of Multi-Criteria Evidential Reasoning
In the future, competitive advantages will be given to organisations that can extract valuable information from massive data and make better decisions. In most cases, this data comes from multiple sources. Therefore, the challenge is to aggregate them into a common framework in order to make them meaningful and useful.This paper will first review the most important multi-criteria decision analy...
متن کاملAdaptive Online Traffic Flow Prediction Using Aggregated Neuro Fuzzy Approach
Short term prediction of traffic flow is one of the most essential elements of all proactive traffic control systems. Although various methodologies have been applied to forecast traffic parameters, several researchers have showed that compared with the individual methods, hybrid methods provide more accurate results . These results made the hybrid tools and approaches a more common method for ...
متن کاملHow Related is Author Topical Similarity to Other Author Relatedness Measures?
Using a dataset of 26,228 Psychology document surrogates from Elsevier databases, we compare author relatedness measure outcomes for 125 authors based on topic modelling to more traditional approaches that rely on direct citation, co-citation and collaboration. Outcomes for the author topical similarity measure are compared to existing co-authorships in the dataset using UCINET/NetDraw. We demo...
متن کاملData Citation: Giving Credit where Credit is Due
An increasing amount of information is being published in structured databases and retrieved using queries, raising the question of how query results should be cited. Since there are a large number of possible queries over a database, one strategy is to specify citations to a small set of frequent queries – citation views – and use these to construct citations to other “general" queries. We pre...
متن کاملMaps on the basis of the Arts & Humanities Citation Index: The journals Leonardo and Art Journal versus "Digital Humanities" as a topic
The possibilities of using the Arts & Humanities Citation Index (A&HCI) for journal mapping have not been sufficiently recognized because of the absence of a Journal Citations Report (JCR) for this database. A quasi-JCR for the A&HCI (2008) was constructed from the data contained in the Web-of-Science and is used for the evaluation of two journals as examples: Leonardo and Art Journal. The maps...
متن کامل